TRECVID 2008 Participation by MCG-ICT-CAS
نویسندگان
چکیده
For TRECVID 2008 concept detection task, we principally focus on: (1) Early fusion of texture, edge and color features TECM, abbreviation of the combined TF*IDF weights based on SIFT features, Edge Histogram, and Color Moments. (2) To improve the training efficiency and explore the knowledge between concepts or hidden sub-domains more easily and efficiently, we propose a novel method based on Latent Dirichlet Allocation (LDA): LDA-based multiple-SVM (LDASVM). We first use LDA to cluster all the keyframes into topics according to the maximum element of the topic-simplex representation vector (TRV) of each keyframe. Then, we train the annotated data in each topic for each concept. During training, unlike multi-bag SVM, we only use positive samples in current topic for the sake of retaining sample’s separability, instead of all positive samples among the whole training set, and ignore the topics with too few positive samples. While testing a keyframe for a given concept, we adopt TRV as the weight vector, instead of equal weighting strategy, to combine the SVM outputs of topic-models. (3) Introduction of Pseudo Relevance Feedback (PRF) into our concept detection system for the purpose of making re-trained models more adaptive to the test data: unlike existing PRF techniques in text and video retrieval, we propose a preliminary strategy to explore the visual features of positive training samples to improve the quality of pseudo positive samples. Experimental results demonstrate that our proposed LDASVM approach is both effective and efficient.
منابع مشابه
TRECVID 2008 Event Detection By MCG-ICT-CAS*
As for Event Detection in TRECVID 2008, we develop a surveillance system with two parts, the trajectorybased sub-system and the domain knowledge-based sub-system. The former focuses on the research of general methods for event discovery. Human detection and tracking are utilized to generate the trajectory and then novel three-level trajectory features are proposed to detect PersonRuns, PeopleMe...
متن کاملTRECVID 2007 High-Level Feature Extraction By MCG-ICT-CAS
We participated in the high-level feature extraction task in TRECVID 2007. This paper describes the details of our system for the task. For feature extraction, we propose an EMD-based bag-of-feature method to exploit visual/spatial information, and utilize WordNet to expand semantic meanings of text to boost up the generalization of detectors. We also explore audio features and extract the moti...
متن کاملKnown-Item Search by MCG-ICT-CAS
This paper describes the highlights of known-item search system for TRECVID 2010. We first propose that there lies Understanding Gap between a video’s author and user, which gap has been represented in the author labeled semantic text(sText) description and user generated visual text(vText) query. To bridge this gap, we explore the structured online knowledge from Wikipedia and the data-driven ...
متن کاملActive Cleaning for Video Corpus Annotation
In this paper, we have described the active cleaning approach that was used to complement the active learning approach in the TRECVID collaborative annotation. It consists in using a classification system in order to select the most informative samples for multiple annotations, in order to improve the quality and the reliability of the annotations. We have evaluated the actual impact of the act...
متن کاملTHU and ICRC at TRECVID 2008
High level feature extraction ID MAP Training set Testing set Brief description run1 0.116 LIG & CAS 1frame/shot & 3frame/shot baseline+keypoint run2 0.123 LIG & CAS 3frame/shot baseline run3 0.057 CAS & Flickr & Peekaboom 1frame/shot & 3frame/shot trecvid+flickr+peekaboom run4 0.103 CAS & Flickr & Peekaboom 1frame/shot & 3frame/shot borda fusion for run3 and run2 run5 0.080 CAS 1frame/shot key...
متن کامل